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Abstract — As the Internet becomes severely overburdened with 
exponentially growing traffic demand, it becomes a general belief 
that a new generation data network is in urgent need today. 
However, standing at this crossroad, we find that we are in a 
situation that lacks a theory of network designing. This issue 
becomes even more serious as the recent progress of network 
measurement and modeling challenges the foundation of network 
research in the past decades. 

This paper tries to set up a scientific foundation for network 
designing by formalizing it as a multi-objective optimization 
process and quantifying the way different designing choices 
independently and collectively influence these objectives. A 
cartesian coordinate system is introduced to map the effect 
of each designing scheme to a coordinate. We investigated the 
achievable area of the network designing space and proved 
some boundary conditions. It is shown that different kind of 
networks display different shapes of achievable areas in the 
cartesian coordinate and exhibit different abilities to achieve 
cost-effective and scalable designing. In particular, we found that 
the philosophy underlying current empirical network designing 
and engineering fails to meet the cost-effective and evolvable 
requirements of network designing. We demonstrated that the 
efficient routing combined with effective betweenness based link 
bandwidth allocation scheme is a cost-effective and scalable 
design for BA-like scale-free networks, whereas if other designing 
choices cannot be determined beforehand, ER network is a 
markedly good candidate for cost-effective and scalable design. 

Index Terms — network designing, cost-effective network de- 
signing, scalable network designing, scale-free network, link 
bandwidth allocation 



I. Introduction 

THE Internet becomes more and more complex and sus- 
ceptible to congestion today. With the emergence of new 
data-intensive applications and fast growing population in need 
of data communication service, most experts agree that the 
existing data network architecture is severely stressed and 
approaching its capability limits [l]-[4]. Thus the move to 
a brand new next generation data network is in urgent need 
today. 

However, before this move, there lacks the theory of net- 
work designing, which becomes more serious as the recent 
surge of network measurement and modeling efforts show 
quite counter-intuitive result, that is, most networks, including 
the Internet, display scale-free structures rather than presum- 
ably random structures [5]-[7], [9]. This recent trend poses 
an urgent need as well as an opportunity for a fundamental 
understanding of the science underlying the network design- 
ing. The Network Science and Engineering Council(NetSE) 
has recommended the creation of "a fundamental science 
of networking that has the potential to underpin systematic 
engineering methodologies for the design of large-scale, ro- 
bust, cost-effective and evolvable networked systems" to be 
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among the top agenda for long-term network research [11]. 
The hardness of establishing this science primarily lies in 
the abstraction of network designing. Too simple model may 
lose its practicality, while too complicated model that includes 
too many details may lead to insolvability. At the heart of 
various designing considerations, a scientific approach should 
clarify the following key issues: what are the typical network 
designing objectives and major designing choices, what are the 
inherent interplays of these choices, how the designing choices 
independently and collectively affect different designing ob- 
jectives, and what are the achievable values of the designing 
objectives. 

From the macroscopic perspective, network designing often 
involves several independent yet closely related aspects, of 
which the three most fundamental ones are what kind of 
network topology shall be used, which routing algorithm 
is effective, and how the link bandwidth will be allocated. 
One can independently choose either ingredients as they are 
conceptually orthogonal to each other. However, when network 
designing objectives are concerned, these ingredients become 
closely interleaved because different combinations will pro- 
duce different effects. 

One major objective for network designing and engineering 
is to enhance the network's transmission capacity, a focus 
of a number of previous researches [12]— [14], [20]-[22]. 
However, we argue that most often, maximizing the network 
transmission capacity could not be the sole objective. A 
network designing is also typically constrained by the financial 
or technical restrictions, so a cost-effective design is preferable. 
In addition, as networks are constantly evolving, a designing 
scheme with good scalability will be more desirable. To 
summarize, previous researches have the following shortcom- 
ings : (1) they focus only on one aspect of the network 
designing ingredients, without a formally defined scientific 
framework ; (2) their studies are based on a traffic flow model 
that is too simple for network engineering; (3) they don't 
consider the technical feasibility of the designing scheme, 
which is crucial for network engineering; (4) they do not 
consider the scalability issue of a designing scheme, which 
is of significance in reality because networks are constantly 
evolving. 

Motivated by these incentives, in this paper, we carried 
out an in-depth study of network designing in a systematic 
approach. The main contributions are: 

1) We propose that network designing is a multi-objective 
optimization process, with often contradictive objec- 
tives, and formalize the framework for network design- 
ing study by defining a two dimensional cartesian coor- 
dinate system according to the two important designing 
objectives; 

2) We compare different designing schemes by quantita- 
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tively mapping the effect of each scheme to a coordinate 
in the coordinate system, and we explore the achievable 
areas in this coordinate system; 
3) We investigate the cost-effectiveness and scalability of 
some representative designing schemes. In particular, 
we find that the current network designing philoso- 
phy is neither cost-effective nor scalable. We propose 
one cost-effective and scalable designing scheme, i.e., 
efficient routing combined with effective betweenness 
based bandwidth allocation, which can achieve pretty 
good tradeoff between the two designing objectives for 
BA-like scale free networks, and we demonstrate that 
if one has the full freedom to select network designing 
choices, ER network is a remarkably good candidate to 
achieve a cost-effective and scalable design, especially 
when other designing ingredients cannot be determined 
beforehand. 

The subsequent paper is structured as follows: We briefly 
review related work in Section UD and introduce our modified 
traffic flow model in section [HI] The major network designing 
objectives and designing choices are presented in section |IV] 
We move forward to the definition and analytical analysis of 
the cartesian coordinate system and achievable areas in section 
[Vl The scalability issue of network designing is discussed in 
section |VT] Finally, we conclude the paper in section IVIII 

II. Related Work 

The past decade has witnessed a surge of network topology 
measurement and modeling research activities [5], [6], [8], 
[10], [23]-[27], which, with abundant evidence, show that 
most real networks, including the Internet, display scale- 
free network structure. This milestone finding challenges the 
foundation of network related studies, since for decades the 
network researches are grounded on the general assumption 
of random network model [33]. 

Recently, there is a research shift from the static network 
topology characterization and modeling to the traffic dynamics 
on these new networks [12]— [14], [19]-[22]. These work, 
primarily for enhancing the network's transmission capacity, 
can be broadly classified into three categories: to make small 
changes to the underlying network topology [12], [13], [20], 
to adjust the routing algorithm [21] and to customize the node 
capability [14], [22]. A straightforward way to improve the 
network's transmission capacity is to create new edges in the 
network [12], [13]. A somewhat counter-intuitive approach 
proposed in [20] is to enhance the BA network's transmission 
capacity by removing edges that connect nodes with high 
betweenness values, hence balancing the traffic across the 
network. In [21], the authors try to achieve the same goal 
by the so called efficient routing algorithm. Instead of routing 
packets along the path with minimum number of hops, efficient 
routing routes packets along the path that minimizes the sum 
of node degrees, hence bypassing those high degree nodes in 
BA network. In [22], the authors investigate this issue from 
yet another angle, i.e., to adjust the node capability. Two 
node capability models are studied in this work, i.e., with 
node capability proportional to its degree and betweenness 



respectively. These two capability models are further inves- 
tigated under the condition that the total capability remains 
fixed [14], [15], which essentially turns the problem of node 
capability assignment to an optimization problem. A central 
concept behind these analysis is the betweenness centrality 
[16]— [18], [32], which bridges the research of static network 
topology and dynamic network traffic analysis. Under certain 
circumstances, the betweenness centrality precisely estimates 
the traffic passing through a node or an edge [16]— [18], [21]. 



III. Traffic flow model 

In this study, we use a traffic-flow model that is slightly 
different from [20]-[22], [28] to more accurately model the 
network realism. Each node is capable of generating, for- 
warding and receiving packets. At each time step, R packets 
are generated at randomly selected sources. The destinations 
are also chosen randomly. Instead of assigning a single node 
capability to each node as previous studies do, we assign 
multiple link bandwidths to multiple interfaces of each node. 
Denote C(vi) to be the link bandwidth of the ith interface of 
node v, then C(vi) specifies the maximal number of packets 
this interface can deliver at a time step. We assume symmetric 
link bandwidth, i.e., for an edge e = (vi,Uj), we have 
C(vi) = C(uj), which is further denoted as C(e). Each packet 
is forwarded toward its destination based on the particular 
routing algorithm used. When the total number of arrived and 
newly created packets to be delivered at interface Vi exceeds 
C(vi), the packets are stored in this interface's queue and will 
be processed in the following time steps on a first-in-first- 
out(FIFO) basis. If there are several candidate paths for one 
packet, one is chosen randomly. Each interface has a queue 
for delivering newly arriving packets. Packets reaching their 
destinations are deleted from the system. As in [20]-[22], [28], 
interface buffer size in this traffic-flow model is set as infinite 
as it is not relevant to the occurrence of congestion. 

For small values of the packet generating rate R, the number 
of packets on the network is small so that every packet can 
be processed and delivered in time. Typically, after a short 
transient time, a steady state for the traffic flow is reached 
where, on average, the total numbers of packets created and 
delivered are equal, resulting in a free-flow state. For larger 
values of R, the number of packets created is more likely to 
exceed what the network can process in time. In this case 
traffic congestion occurs. As R is increased from zero, we 
expect to observe two phases: free flow for small R and a 
congested phase for large R with a phase transition from the 
former to the latter at the critical packet-generating rate R c . 

In order to measure R c , we use the order parameter [31] 
r] = Hindoo , where 9(i) is the total number of packets 
in the network at time t, A9 = 9(i + At) - 9(t), and (• • • ) 
indicates the average over time windows of At. For R < R c 
the network is in the free-flow state, then A9 « and r\ « 0; 
and for R > R c , A9 increases with At thus ?/ > 0. Therefore 
in our simulation we can determine R c as the transition point 
where 77 deviates from zero. 
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TABLE I: Elementary topological properties of the five net- 
works, each containing 1200 nodes. TV and M denote the 
number of nodes and number of edges respectively, D denotes 
the diameter , L denotes the average shortest path length. 

Network N M D L 



WS 1200 2400 15.5 7.86 

ER 1200 2450 11 5.23 

BA 1200 2390 8 4.43 

PA 1200 2400 8.7 4.03 

HOT 1200 2583 9 5.16 



IV. Network designing choices and objectives 

Of the various network designing choices, the following 
three ones are of critical relevance: routing algorithm, link 
bandwidth allocation scheme, and network topology. Design- 
ers could be faced with different scenarios characterized by the 
choices he can make. For example, when designing a brand 
new network from the very beginning, designers own the full 
freedom of choosing each ingredient, while when improving 
an existing network, designers can only change the routing 
algorithm or upgrade the link bandwidth. 

A specific designing is often associated with explicit design- 
ing objectives. In this paper, we consider three major network 
designing objectives: the network transmission capacity, which 
should be as large as possible, the maximal bandwidth required 
to realize the designing scheme, which should be as small as 
possible, and the scalability, which should be as scalable as 
possible. 

Enhancing network transmission capacity is one of the 
major goals for network designing and engineering, and is 
typically measured by the critical packet-generating rate R c 
[20]-[22], [28]. Although a simulation-based approach is 
feasible to measure R c , it is time-consuming. The betweenness 
centrality provides a means to analytically evaluate the R c 
under the shortest path routing algorithm. 

In our modified model, for any given network under shortest 
path routing, the expected number of packets passing through 
an edge is at one time step, with each half flowing 

towards one direction, where B(e) is the betweenness centrality 
of the edge e. So for interface Uj not to get congested, it 



follows that 

2C(vj)N(N-l) 
B(e) 



\ ' N(N-\) — C(vi), which leads to R < 
. As a result, the critical packet generating rate 



R c for shortest path routing is: 



R c 



min e€E - 



2C(e)N(N - 1) 



(1) 



where E corresponds to the edge set of the network. 

In the more general situation, for any topology-basea^ rout- 
ing algorithm T, we introduce the effective betweenness B r (e) 
(similar to the definition in [21]) to estimate the possible traffic 
passing through an edge under routing algorithm T, which is 

1 topology-based routing algorithm means routing decision is made solely 
on the static topological information, not on dynamic traffic information. 



formally defined as: 



E 



5 r (u, v) 



where V is the node set, S r (u,v) is the total number of 
candidate paths between node u and v under routing algorithm 
r, and S^(u,v) is the number of candidate paths that pass 
through edge e between u and v under routing algorithm V. 

Following this definition, the critical packet generating rate 
R c under routing algorithm T can be calculated as: 



Rc 



mvn eeE - 



2C(e)N{N - 1) 



(2) 



This equation also explains how different designing ingre- 
dients affect the R c . C(e) represents the bandwidth allocation 
scheme, and B r (e) is the collective effect of network topology 
and routing algorithm. 

The maximal bandwidth, denoted as C max , required to 
realize the R c is another designing objective of particular 
importance to network designing and engineering, because 
it directly relates to the technical feasibility(sometimes also 
monetary issue) of the specific designing scheme. By posing 
an upper bound of the required link bandwidth, it can be 
used to judge whether the proposed designing scheme can be 
realized with state-of-art technologies or available monetary 
budget. 

Scalability is an issue that becomes increasingly important 
because today's networks are all large-scale and constantly 
evolving. A scalable designing will have long-term benefit 
for the network investors and operators. In this study, the 
scalability will be measured by the growth trends of R c and 

Two typical routing algorithms are investigated in this paper: 
the shortest path routing, abbreviated as SPR, and the efficient 
routing [21], abbreviated as EFR. More formally, the efficient 
routing chooses a path s = Vo,Vi,V2, ■ ■ ■ ,Vk = t between 
s and t that minimizes the objective function J2o<i<k d(vi), 
where d(vi) is the vertex degree of V{. 

Three link bandwidth allocation schemes are analyzed: the 
uniform link bandwidth allocation scheme, denoted as UC, the 
betweenness based link bandwidth allocation scheme, denoted 
as BC , and the effective betweenness based link bandwidth 
allocation scheme, denoted as EBC. In UC, each link has the 
same bandwidth, while in BC and EBC, each link's bandwidth 
is proportional to its edge betweenness and effective be- 
tweenness respectively. For the purpose of comparing between 
different bandwidth allocation schemes, we keep the condition 
that the total link bandwidth assigned to all edges remains 
fixed, which is set to the number of edges in the network, i.e., 
J2 e B(e) — M. The BC scheme is also investigated in [22], 
but it is not treated as an optimization problem because the 
total link bandwidth is not fixed. 

Thus, if the network topology is predetermined, a de- 
signing scheme is a combination of routing algorithm and 
link bandwidth allocation scheme. When network topology is 
undetermined, it becomes another ingredient of the designing 
scheme. We investigate five network topologies in this paper: 



IEEE/ACM TRANSACTIONS ON NETWORKING 



4 



TABLE II: Theoretical result of the critical packet-generating rate R c under different combinations for networks in Table U 
The result is obtained from Equation [2] For each kind of network, we generate 10 instances, and the result is the average of 
the 10 instances. 



(bandwidth allocation scheme, routing algorithm) 


BA 


PA 


HOT 


ER 


WS 


(UC, SPR) 


88.5 


92.8 


99.1 


284.5 


111.1 


(UC, EFR) 


264.3 


195.3 


94.6 


390.6 


147.9 


(BC, SPR) 


1079.4 


1192.2 


1001.7 


937.9 


610.2 


(EBC, EFR) 


766.6 


844.9 


909.2 


891.4 


606.2 



TABLE III: Simulation result of the critical packet-generating rate R c under different combinations for BA, PA, HOT, ER 
and WS networks. For each kind of network, 10 instances are generated and for each instance, we run ten simulations. The 
result here is the average over all the simulations. Compared with Table [TTT1 it is shown that the simulation result is roughly 



(bandwidth allocation scheme, routing algorithm) 


BA 


PA 


HOT 


ER 


WS 


(UC, SPR) 


110.2 


115.6 


119.9 


370.5 


136.7 


(UC, EFR) 


320.4 


249.6 


114.6 


519.8 


174.7 


(BC, SPR) 


934.2 


1042.7 


840.8 


844.2 


517.0 


(EBC, EFR) 


678.0 


741.0 


768.2 


809.9 


519.9 



TABLE IV: C max under different combinations of network topologies, routing algorithms and bandwidth allocation schemes. 
It is evident that betweenness based link bandwidth allocation scheme often incurs high C max , especially for heterogenous 
networks. Although (EBC,EFR) can assure much lower C max for BA-like scale-free networks, the reduction is not apparent 
for HOT network. 



(bandwidth allocation scheme, routing algorithm) 


BA 


PA 


HOT 


ER 


WS 


(UC, *) 
(BC, *) 
(EBC, EFR) 


1 

12.43 
2.91 


1 

12.99 
4.45 


1 

10.30 
9.70 


1 

3.37 
2.40 


1 

5.63 
4.13 



BA [6], HOT [29], ER [33], WS [34], and PA. BA network is 
constructed according to the standard BA model with m = 2. 
ER network is constructed by the G n (p) model with the 
constraint of connectedness. WS network is built from the 
ring by randomly rewiring 15 percent of its edges, also with 
the constraint of connectedness. PA is indeed a variant of the 
BA model, and is generated by the following process: begin 
with 3 fully connected nodes, and add one new node to the 
graph in successive steps, such that this new node is connected 
to the existing nodes with probability proportional to the 
current node degree, and finally, add some internal edges to 
augment the graph by selecting both endpoints with probability 
proportional to the current node degree. The main difference 
between PA and BA is that PA has the rich-club structure [30], 
while BA does not. Finally, the HOT network is a heuristically 
optimal topology for the Internet router-level network, which 
can be roughly partitioned into three hierarchies: the low 
degree core routers, the high degree gateway routers hanging 
from the core routers, and the low degree periphery nodes con- 
nected with the gateway routers. The HOT networks generated 
here follow the same degree distributions as the corresponding 
PA networks. The basic network properties of these networks 
of size 1200 are presented in Table [] 

V. The cartesian coordinate system and 

ACHIEVABLE AREAS 

For a given network, the effect of a designing scheme can 
be mapped to a coordinate in the cartesian coordinate system, 



with Cmax being the X-axis and R c being the y-axis. A 
natural question is what are the value domains of C max and 
R c . 

It is easy to see that C ma:E 's value range is [1, M]. The 
optimal (smallest) value is achieved with uniform bandwidth 
allocation scheme, and the largest is achieved by allocating all 
the bandwidth to a single edge. 

Regarding R c , the lower bound is 0, which can be achieved 
by some loop based routing algorithms, that is, the packets are 
always looping in the network. 

The upper bound R c is given by the following Theorem. 

Theorem 1. Given a network G, the upper bound R c for any 
designing scheme is 2M/L, where L is the average shortest 
path length of G; and, this upper bound is only achieved with 
(BC, SPR). 

Proof: The general idea of the proof is that the number 
of packets generated at each time step cannot exceed the 
maximal number of packets the network can consume at each 
time step. Since the total link bandwidth is M, the sum of 
interface bandwidth is 2M, which means the network can at 
most forward 2M packets one step towards their destinations. 
Because each packet is generated with random source and 
destination, the expected number of steps needed to move 
one packet from the source to its destination under routing 
algorithm V is L r , which is equivalent to say that the network 
can consume at most 2M/L r packets at each time step. So, 
we have 

R c < 2M/L r < 2M/L (3) 
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which means 2M/L is an upper bound of R c 

Then we prove that (BC, SPR) can indeed achieve this upper 
bound R c . Recall that R c = min e ^E 
(BC, SPR), can be rewritten as: 



2C{e)N(N-l) 
We) 



which in 



R c = min e&E 



min e( zE 



2C(e)N(N - 1) 

W) 

2B(e)/j: feE B(f)xMxN(N-l) 
B(e) 



_ 2MN(N - 1) 

_ 2MN(N - 1) 
~ N(N-1)L 
_ 2M 
~ 17 

This proves that the upper bound 2M/L is achievable. 

Finally, we prove that (BC,SPR) is the only scheme that 
achieves R c = 2M/L. According to the Inequality [3] only 
when two conditions hold simultaneously can R c reach 2M/L. 
These two conditions are: (1)L T — L, which implies that the 
routing algorithm is SPR, and (2) the network can on average 
move 2M packets one step toward their destinations. Because 
in SPR, the expected number of packets arriving at interface Vi 
in free flow state is exactly | j^n^T) ' wnere e i s incident edge 
of interface When R = 2M/L, the average total number 
of packets in the network is ^ e ^j^-i) = 2M. So, for the 
whole network to move 2M packets at each time step, each 
interface should exactly move j^p^rftE packets at each time 
step, which corresponds to the BC link bandwidth allocation 
scheme. ■ 

In fact, (BC, SPR) reflects the conventional basic philos- 
ophy of real-world empirical network designing. Real-world 
network routing protocols such as OSPF and BGP are often 
based on shortest path routing(or at least making path length 
a major decision factor in path selection). High bandwidth 
links are placed at key points of the network, and when 
some links get overburdened, they will be upgraded by links 
with higher bandwidth, potentially attracting even more traffic 
and incuring a vicious cycle in traffic demanding and link 
bandwidth upgrading. 

Theorem Q] also implies that the optimal R c and optimal 
C m ax cannot be achieved at the same scheme, except for com- 
plete homogenous regular networks, indicating the presence 
of a tradeoff issue, which is confirmed by Table EE with the 
simulation result of R c presented in Table [ITTb and Table ES] 

It can be seen that although (BC,SPR) can guarantee highest 
R c , it often incurs high C max , because C max = n{n-i)l is 
proportional to the largest betweenness, which is especially 
large for heterogenous networks. Hence, (BC,SPR) is not 
a cost-effective design for heterogenous networks. A conse- 
quently natural question is what is the achievable R c with the 

lowest Cmax- 

With (UC, SPR), it is easy to show that R c = 2 ^~ 1) . For 
regular networks such as ring and toroidal lattice, (UC, SPR) 
achieves the same R c as (BC,SPR), however, for heterogenous 
networks, B max can be quite large so that its R c will be 



significantly dwarfed compared with R c in (BC, SPR)(see 
Table EH and Fig. [2(a)}. 

In BA-like scale-free networks, this R c can be improved by 
changing the routing algorithm from SPR to EFR (see Table ED 
and Table En). Indeed, the R c in (UC,EFR) is R c = ^fe^ , 

where Bf^^ R ' is the largest effective betweenness centrality 
among all edges. While EFR can significantly improve BA-like 
network's transmission capacity because -Bn^x is substan- 
tially decreased compared with B max , it is ineffective for the 
more realistic HOT model and WS network, and less effective 
for the ER network(see Table ED- 

It is evident that there is an optimal R c at the lowest C max . 
It is easy to demonstrate the existence of such a routing 
algorithm that will realize this R c , but it is not an easy task to 
find a practically implementable routing algorithm. Recall that 
so the essence to realize the optimal 



with UC, R r = 



_ gjVfjV— 1) 



R c at UC is to fin3 a routing algorithm that minimizes the 
B max , which is equivalent to find a path set of size N(N — 1), 
containing exactly one path for each node pair, such that the 
maximal per edge occurrence in this path set is minimized. 
The intended path set can be found by firstly computing all 
the simple paths between any node pairs, and then enumerating 
all combinations of such path set. However, this approach in 
reality is infeasible for even median sized networks. A greedy 
heuristic algorithm that works can be as follows: 

1) compute all the simple paths between any node pairs; 

2) construct an initial path set, for example, the path set 
induced by the shortest path routing or efficient routing; 

3) for any node pair, if there is a path such that if replacing 
this path with the present one in the path set will reduce 
the maximal per edge occurrence, replace the present 
path with this path. Repeat this step until there is no 
such path. 

In reality, this routing algorithm is impractical. The signif- 
icance of this scheme is that it is a key point in shaping the 
skeleton of the achievable area. 

An illustration of the achievable area for BA-like scale- 
free networks is shown in Fig. |l(a)| in which the four typical 
designing schemes are marked. The other three points A, B 
and C, which are important for a conceptual bounding of the 
achievable area are also illustrated, where A is a scheme that 
combines uniform link bandwidth allocation scheme and some 
routing algorithm that keeps the packet always looping around 
the network, B is a scheme that allocates all the link bandwidth 
to a single edge so that the network's transmission capacity 
also approaches zero, and C is a scheme whose existence 
is mentioned above that realizes the optimal R c value with 
uniform link bandwidth allocation scheme. 

Conceptually, A, B, C and (BC, SPR) together roughly 
define the skeleton of the achievable area. However, the shapes 
may vary dramatically for different networks. For example, 
in regular networks such as the ring or toroidal lattice, the 
achievable area for network designing schemes looks like 
Fig |l(b)| in which all the four schemes discussed in this paper 
as well as C will converge to the single point C. And (UC,EFR) 
will not always top (UC,SPR), such as in WS network. 

From an engineering perspective, a relevant problem is 



IEEE/ACM TRANSACTIONS ON NETWORKING 



6 





(a) Rc (b) Cmax 

Fig. 2: (Color online) Scalability of (a)R c values of BA, HOT and ER under four different configurations; and (b) C max values 
of BA and ER under (BC,SPR) and (EBC, EFR) schemes. 



whether there exists a designing scheme that achieves good 
tradeoff between R c and C max . The answer is that this 
property heavily depends on the particular network topology. 
However, in many cases, such designing exists. This paper 
proposes one such designing, (EBC, EFR), which proves to 
be cost-effective for most small-world networks, especially for 
BA-like scale-free networks. 

To see why this scheme has the potential of achieving good 



tradeoff, first note that with (EBC, EFR), R c = 



2M 

L(EFR) 



and, 



N(N-1)L( EFR 1 



So, we have: 



R (BC,SPR) L (EFR) 



(EBC, EFR) 



R 



L 



and 



r (BC,SPR) r(EFR) 



B n 



r ,{EBC,EFR) 



(EFR) 



B 



In most networks, L^ EFR ) is only slightly longer than L 
|(see Fig|3(b)lforthe average path length of different routing 

2 This also ensures that the path length overhead incurred is also acceptable 
for all the networks investigated in this paper. 



algorithms in different networks), so the R c in (EBC,EFR) is 
only slightly smaller than the optimal one. However, C m ax 
in (BC,SPR) is L T 



times larger than the C n 



in (EBC,EFR). This expresses why the cost-effectiveness of 
(EBC,EFR) is topology-dependent. 

In HOT network, although with the same skewed degree 
distribution, its rigid hierarchy structure restricts the flexibility 
in path selection, making it insensitive to routing algorithm 
changes, so B max and Bmax show no big difference. In WS 
network, the near uniform degree distribution makes SPR and 
EFR nearly identical routing algorithms, so B max and Bmax R ^ 
also show no difference. The difference between HOT and WS 
is that in HOT network, C max is high, while, in WS, C max 
is relatively low. 

Scale-free networks such as BA and PA, on the other hand, 
have the inherent flexibility in path selection and show strong 
sensitivity to routing algorithm changes. In these networks, 
B m ax can be an order of magnitude larger than Bm a F x R \ 
which makes C max in (EBC,EFR) quite smaller compared 
with (BC,SPR). 

ER network's random structure makes it lies somewhere 
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(a) B max (b) APL 

Fig. 3: (Color online) (a) Scalability of the maximal betweenness and effective betweenness for BA and ER networks. With SPR, 
the F-axis denotes the value of maximal betweenness, and with EFR, the F-axis denotes the maximal effective betweenness. 
The line fittings for these data are also presented, and (b) average path length for the five networks with different size under 
both shortest path routing and efficient routing. The filled shape represents the SPR, while the hollow shape represents the 
EFR. 



between the WS and BA network. For ER network, both 
(BC,SPR) and (EBC,EFR) are good designing choices. Even 
(UC,SPR) and (UC,EFR) also achieves relatively high R c 
values compared with other networks. 

VI. Scalability of network designing schemes 

The scalability is investigated by the growth trends of R c 
and C max when network evolves. Conceptually, the scalability 
can be viewed as a third dimension of the designing objective, 
i.e., the time dimension. Figure |2(a)| presents the R c values 
for BA and ER networks under four different settings and 
Figure |2(b)| presents the C max values for the five networks 
under (BC, SPR) and (EBC, EFR). It is shown that under 
(UC, SPR), BA network's R c value remains quite stable, i.e., 
almost not scaling with N . This is because, B max scales super 
linearly with N, as illustrated in Fig. |3(a)| Numerical fitting on 
simulation result shows that B max ~ N 1 - 711 in BA network, 
whereas in ER network, B max ~ ]\T 1-338 . On the other hand, 
Bmax grows much slowly as network expands, ~ N 1 - 227 
and ~ iV 1 137 for BA and ER respectively. This means with 
(UC, SPR), BA network's R c scales very slowly, while ER 
network's R c scales much better. However, with (UC, EFR) 
the scalability of R c significantly improves for BA network. 

Regarding Cmax, with (BC, SPR), C ma x oc B max , so 
it grows fast for BA, HOT and PA networks, and much 
more slowly for ER and WS networks. With (EBC, EFR), 
Cmax oc Bmax R \ which scales much slowly in BA and 
PA networks, but still grows fast in HOT network. This is 
evidenced in Fig |2(b)| where C max grows fast with (BA, BC, 
SPR), (PA, BC, SPR) and (HOT, BC, SPR) and (HOT, EBC, 
EFR), but remains nearly stable for other settings. 

In short, the above analysis show that (BC,SPR) is not a 
scalable designing for heterogenous networks. On the other 
hand, our proposed (EBC, EFR) is a cost-effective designing 
that is scalable for most of the networks with small-world 



properties, except for heterogenous networks with rigid hierar- 
chy structure, and ER network shows the potential of achieving 
scalable cost-effective designing under any settings. 

VII. Conclusions 

By proposing that network designing is not a single ob- 
jective optimization process, but rather a multi-objective opti- 
mization process, this paper established a scientific foundation 
for network designing studies. It has been found that typical 
network designing objectives such as network transmission 
capacity and technical feasibility are often contradictive, which 
indicates the presence of a tradeoff issue. We defined a two 
dimensional cartesian system to evaluate the effect of each 
designing scheme. 

In particular, the empirical network designing approach 
widely adopted today is incapable to meet the cost-effective 
and scalable requirements. The existence of cost-effective 
design depends heavily on the network topology. For BA-like 
scale-free networks, efficient routing combined with effective 
betweenness based link bandwidth allocation scheme can be 
regarded as a cost-effective design, and this property also 
scales as network evolves. If the designer has the full freedom 
of choosing all designing ingredients, then ER network shows 
pretty good characteristic in achieving a cost-effective design 
under most settings. 

We believe the scientific framework as well as the specific 
findings will give insightful help to advance the science of 
network designing and engineering. Evaluating the perfor- 
mance of other designing schemes under this framework, and 
finding the exact boundary of the achievable area, for example, 
whether the boundary is concave or convex for different 
networks, are the direction for future work. 
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